智能论文笔记

Statistical Machine Translation for Indic Languages

Sudhansu Bala Das , Divyajoti Panda , Tapas Kumar Mishra , Bidyut Kr. Patra

分类：自然语言处理

2023-01-02

Machine Translation (MT) system generally aims at automatic representation of source language into target language retaining the originality of context using various Natural Language Processing (NLP) techniques. Among various NLP methods, Statistical Machine Translation(SMT). SMT uses probabilistic and statistical techniques to analyze information and conversion. This paper canvasses about the development of bilingual SMT models for translating English to fifteen low-resource Indian Languages (ILs) and vice versa. At the outset, all 15 languages are briefed with a short description related to our experimental need. Further, a detailed analysis of Samanantar and OPUS dataset for model building, along with standard benchmark dataset (Flores-200) for fine-tuning and testing, is done as a part of our experiment. Different preprocessing approaches are proposed in this paper to handle the noise of the dataset. To create the system, MOSES open-source SMT toolkit is explored. Distance reordering is utilized with the aim to understand the rules of grammar and context-dependent adjustments through a phrase reordering categorization framework. In our experiment, the quality of the translation is evaluated using standard metrics such as BLEU, METEOR, and RIBES

translated by 谷歌翻译

Improving Multilingual Neural Machine Translation System for Indic Languages

Sudhansu Bala Das , Atharv Biradar , Tapas Kumar Mishra , Bidyut Kumar Patra

分类：自然语言处理

2022-09-27

机器翻译系统（MTS）是通过将文本或语音从一种语言转换为另一种语言的有效工具。在像印度这样的大型多语言环境中，对有效的翻译系统的需求变得显而易见，英语和一套印度语言（ILS）正式使用。与英语相反，由于语料库的不可用，IL仍然被视为低资源语言。为了解决不对称性质，多语言神经机器翻译（MNMT）系统会发展为在这个方向上的理想方法。在本文中，我们提出了一个MNMT系统，以解决与低资源语言翻译有关的问题。我们的模型包括两个MNMT系统，即用于英语印度（一对多），另一个用于指示英语（多一对多），其中包含15个语言对（30个翻译说明）的共享编码器码头。由于大多数IL对具有很少的平行语料库，因此不足以训练任何机器翻译模型。我们探索各种增强策略，以通过建议的模型提高整体翻译质量。最先进的变压器体系结构用于实现所提出的模型。大量数据的试验揭示了其优越性比常规模型的优势。此外，本文解决了语言关系的使用（在方言，脚本等方面），尤其是关于同一家族的高资源语言在提高低资源语言表现方面的作用。此外，实验结果还表明了ILS的倒退和域适应性的优势，以提高源和目标语言的翻译质量。使用所有这些关键方法，我们提出的模型在评估指标方面比基线模型更有效，即一组ILS的BLEU（双语评估研究）得分。

translated by 谷歌翻译

Part-of-Speech Tagging of Odia Language Using statistical and Deep Learning-Based Approaches

Tusarkanta Dalai , Tapas Kumar Mishra , Pankaj K Sa

分类：自然语言处理

2022-07-07

自动言论（POS）标记是许多自然语言处理（NLP）任务的预处理步骤，例如名称实体识别（NER），语音处理，信息提取，单词sense sisse disampigation和Machine Translation。它已经在英语和欧洲语言方面取得了令人鼓舞的结果，但是使用印度语言，尤其是在Odia语言中，由于缺乏支持工具，资源和语言形态丰富性，因此尚未得到很好的探索。不幸的是，我们无法为ODIA找到一个开源POS标记，并且仅尝试为ODIA语言开发POS标记器的尝试。这项研究工作的主要贡献是介绍有条件的随机场（CRF）和基于深度学习的方法（CNN和双向长期短期记忆）来开发ODIA的语音部分。我们使用了一个公开访问的语料库，并用印度标准局（BIS）标签设定了数据集。但是，全球的大多数语言都使用了带有通用依赖项（UD）标签集注释的数据集。因此，要保持统一性，odia数据集应使用相同的标签集。因此，我们已经构建了一个从BIS标签集到UD标签集的简单映射。我们对CRF模型进行了各种特征集输入，观察到构造特征集的影响。基于深度学习的模型包括BI-LSTM网络，CNN网络，CRF层，角色序列信息和预训练的单词向量。通过使用卷积神经网络（CNN）和BI-LSTM网络提取角色序列信息。实施了神经序列标记模型的六种不同组合，并研究了其性能指标。已经观察到具有字符序列特征和预训练的单词矢量的BI-LSTM模型取得了显着的最新结果。

translated by 谷歌翻译

Design of Human Machine Interface through vision-based low-cost Hand Gesture Recognition system based on deep CNN with transfer-learning approach

Abir Sen , Tapas Kumar Mishra , Ratnakar Dash

分类：计算机视觉

2022-07-07

在这项工作中，提出了基于实时手势识别系统的实时手势识别系统界面（HCI）。该系统由六个阶段组成：（1）手势分割，（3）使用转移学习方法使用六个预训练的CNN模型，（4）构建交互式的人机界面，（（ 5）开发手势控制的虚拟鼠标，（6）使用卡尔曼过滤器来估计手部位置，因为指针的平滑度得到了改善。六个预训练的卷积神经网络（CNN）模型（VGG16，VGG19，RESNET50，RESNET101，INCEPTION-V1和MOBILENET-V1）已用于对手势图像进行分类。三个多级数据集（两个公开和一个自定义）已用于评估模型性能。考虑到模型的性能，已经观察到，与其他五个预训练的模型相比，Inception-V1在准确性，精度，召回和F-SCORE值方面表现出了更好的分类性能。手势识别系统已扩展并用于控制多媒体应用程序（例如VLC播放器，音频播放器，文件管理，播放2D Super-Mario-Bros游戏等），并在实时场景中具有不同的自定义手势命令。该系统的平均速度已达到35 fps（每秒帧），满足实时场景的要求。

translated by 谷歌翻译

Biomedical image analysis competitions: The state of current participation practice

Matthias Eisenmann , Annika Reinke , Vivienn Weru , Minu Dietlinde Tizabi , Fabian Isensee , Tim J. Adler , Patrick Godau , Veronika Cheplygina , Michal Kozubek , Sharib Ali

分类：计算机视觉 | 机器学习

2022-12-16

The number of international benchmarking competitions is steadily increasing in various fields of machine learning (ML) research and practice. So far, however, little is known about the common practice as well as bottlenecks faced by the community in tackling the research questions posed. To shed light on the status quo of algorithm development in the specific field of biomedical imaging analysis, we designed an international survey that was issued to all participants of challenges conducted in conjunction with the IEEE ISBI 2021 and MICCAI 2021 conferences (80 competitions in total). The survey covered participants' expertise and working environments, their chosen strategies, as well as algorithm characteristics. A median of 72% challenge participants took part in the survey. According to our results, knowledge exchange was the primary incentive (70%) for participation, while the reception of prize money played only a minor role (16%). While a median of 80 working hours was spent on method development, a large portion of participants stated that they did not have enough time for method development (32%). 25% perceived the infrastructure to be a bottleneck. Overall, 94% of all solutions were deep learning-based. Of these, 84% were based on standard architectures. 43% of the respondents reported that the data samples (e.g., images) were too large to be processed at once. This was most commonly addressed by patch-based training (69%), downsampling (37%), and solving 3D analysis tasks as a series of 2D tasks. K-fold cross-validation on the training set was performed by only 37% of the participants and only 50% of the participants performed ensembling based on multiple identical models (61%) or heterogeneous models (39%). 48% of the respondents applied postprocessing steps.

translated by 谷歌翻译

Review of Methods for Handling Class-Imbalanced in Classification Problems

Satyendra Singh Rawat , Amit Kumar Mishra

分类：机器学习

2022-11-10

Learning classifiers using skewed or imbalanced datasets can occasionally lead to classification issues; this is a serious issue. In some cases, one class contains the majority of examples while the other, which is frequently the more important class, is nevertheless represented by a smaller proportion of examples. Using this kind of data could make many carefully designed machine-learning systems ineffective. High training fidelity was a term used to describe biases vs. all other instances of the class. The best approach to all possible remedies to this issue is typically to gain from the minority class. The article examines the most widely used methods for addressing the problem of learning with a class imbalance, including data-level, algorithm-level, hybrid, cost-sensitive learning, and deep learning, etc. including their advantages and limitations. The efficiency and performance of the classifier are assessed using a myriad of evaluation metrics.

translated by 谷歌翻译

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

Teven Le Scao , Angela Fan , Christopher Akiki , Ellie Pavlick , Suzana Ilić , Daniel Hesslow , Roman Castagné , Alexandra Sasha Luccioni , François Yvon , Matthias Gallé

分类：自然语言处理

2022-11-09

Large language models (LLMs) have been shown to be able to perform new tasks based on a few demonstrations or natural language instructions. While these capabilities have led to widespread adoption, most LLMs are developed by resource-rich organizations and are frequently kept from the public. As a step towards democratizing this powerful technology, we present BLOOM, a 176B-parameter open-access language model designed and built thanks to a collaboration of hundreds of researchers. BLOOM is a decoder-only Transformer language model that was trained on the ROOTS corpus, a dataset comprising hundreds of sources in 46 natural and 13 programming languages (59 in total). We find that BLOOM achieves competitive performance on a wide variety of benchmarks, with stronger results after undergoing multitask prompted finetuning. To facilitate future research and applications using LLMs, we publicly release our models and code under the Responsible AI License.

translated by 谷歌翻译

UATTA-ENS: Uncertainty Aware Test Time Augmented Ensemble for PIRC Diabetic Retinopathy Detection

Pratinav Seth , Adil Khan , Ananya Gupta , Saurabh Kumar Mishra , Akshat Bhandari

分类：计算机视觉 | 人工智能 | 机器学习

2022-11-06

Deep Ensemble Convolutional Neural Networks has become a methodology of choice for analyzing medical images with a diagnostic performance comparable to a physician, including the diagnosis of Diabetic Retinopathy. However, commonly used techniques are deterministic and are therefore unable to provide any estimate of predictive uncertainty. Quantifying model uncertainty is crucial for reducing the risk of misdiagnosis. A reliable architecture should be well-calibrated to avoid over-confident predictions. To address this, we propose a UATTA-ENS: Uncertainty-Aware Test-Time Augmented Ensemble Technique for 5 Class PIRC Diabetic Retinopathy Classification to produce reliable and well-calibrated predictions.

translated by 谷歌翻译

Machine Learning based Extraction of Boundary Conditions from Doppler Echo Images for Patient Specific Coarctation of the Aorta: Computational Fluid Dynamics Study

Vincent Milimo Masilokwa Punabantu , Malebogo Ngoepe , Amit Kumar Mishra , Thomas Aldersley , John Lawrenson , Liesl Zulke

分类：机器学习

2022-09-19

主动脉（COA）患者特异性计算流体动力学（CFD）研究的目的 - 在资源约束设置中的研究受到可用成像方式和速度数据采集的可用成像方式的限制。多普勒超声心动图被视为合适的速度获取方式，因为其可用性和安全性较高。这项研究旨在调查经典机器学习（ML）方法的应用，以创建一种适当且可靠的方法，用于从多普勒超声心动图图像中获得边界条件（BCS），用于使用CFD进行血液动力学建模。方法 - 我们提出的方法结合了ML和CFD，以模拟感兴趣区域内的血流动力学流动。该方法的关键特征是使用ML模型来校准CFD模型的入口和出口边界条件（BCS）。 ML模型的关键输入变量是患者心率，因为这是研究中测得的血管的时间变化的参数。在研究的CFD组件中使用ANSYS Fluent，而Scikit-Learn Python库则用于ML分量。结果 - 我们在干预前对严重COA的真实临床案例进行了验证。将我们的模拟的最大缩回速度与从研究中使用的几何形状获得的患者获得的测量最大骨质速度进行了比较。在用于获得BCS的5 mL模型中，顶部模型在测得的最大骨质速度的5 \％之内。结论 - 该框架表明，它能够考虑在测量之间考虑患者心率的变化。因此，当在每个血管上缩放心率时，可以在生理上逼真的BC计算，同时提供合理准确的溶液。

translated by 谷歌翻译

SOLBP: Second-Order Loopy Belief Propagation for Inference in Uncertain Bayesian Networks

Conrad D. Hougen , Lance M. Kaplan , Magdalena Ivanovska , Federico Cerutti , Kumar Vijay Mishra , Alfred O. Hero III

分类：人工智能 | 机器学习 | (统计)机器学习

2022-08-16

在二阶不确定的贝叶斯网络中，条件概率仅在分布中已知，即概率上的概率。Delta方法已应用于扩展精确的一阶推理方法，以通过从贝叶斯网络得出的总和产物网络传播均值和方差，从而表征了认知不确定性或模型本身的不确定性。另外，已经证明了Polytrees的二阶信仰传播，但没有针对一般的定向无环形结构。在这项工作中，我们将循环信念传播扩展到二阶贝叶斯网络的设置，从而产生二阶循环信念传播（SOLBP）。对于二阶贝叶斯网络，SOLBP生成了与Sum-Propoduct网络生成的网络一致的推论，同时更加有效且可扩展。

translated by 谷歌翻译